Automatic Phonetic Segmentation for a Speech Corpus of Hebrew

نویسندگان

Nikša Jakovljević

Dragiša Mišković

Darko Pekar

Milan Sečujski

Vlado Delić

چکیده

This paper presents our study on different phonetic segmentation methods based on hidden Markov models evaluated against a Hebrew speech corpus. We investigated methods for fully automatic phonetic segmentation using only the corpus which should be segmented and automatically generated phonetic transcriptions. A new method for phonetic boundary correction based on spectral variation of the speech signal is proposed. The proposed method increased the boundary correctness of the baseline HMM segmentation system from 30.2%, 59.5% and 86.2% of automatic boundary marks with error smaller than 5, 10 and 20 ms respectively, to 52.3%, 76.3% and 90.7%.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Tools for Analyzing Spoken Hebrew

This work summarizes our project to propose a set of automatic tools for analyzing the phonetic and phonological content of spoken Hebrew. The goal of the project is to provide a set of resources to scientists and engineers who work on research and engineering problems related to the acoustics and linguistics of the modern Hebrew language. The set of tools includes: (i) a transcribed corpus of ...

متن کامل

Minimum boundary error training for automatic phonetic segmentation

Annotated speech corpora are indispensable to various areas of speech research. In this paper, we present a novel discriminative training approach for HMM-based automatic phonetic segmentation. The objective of the proposed minimum boundary error (MBE) discriminative training approach is to minimize the expected boundary errors over a set of phonetic alignments represented as a phonetic lattice...

متن کامل

A Minimum Boundary Error Framework for Automatic Phonetic Segmentation

This paper presents a novel framework for HMM-based automatic phonetic segmentation that improves the accuracy of placing phone boundaries. In the framework, both training and segmentation approaches are proposed according to the minimum boundary error (MBE) criterion, which tries to minimize the expected boundary errors over a set of possible phonetic alignments. This framework is inspired by ...

متن کامل

Impact of frame rate on automatic speech-text alignment for corpus-based phonetic studies

Phonetic segmentation is the basis for many phonetic and linguistic studies. As manual segmentation is a lengthy and tedious task, automatic procedures have been developed over the years. They rely on acoustic Hidden Markov Models. Many studies have been conducted, and refinements developed for corpus based speech synthesis, where the technology is mainly used in a speaker-dependent context and...

متن کامل

A Multimodal Corpus of Expert Gaze and Behavior during Phonetic Segmentation Tasks

Phonetic segmentation is the process of splitting speech into distinct phonetic units. Human experts routinely perform this task manually by analyzing auditory and visual cues using analysis software, which is an extremely time-consuming process. Methods exist for automatic segmentation, but these are not always accurate enough. In order to improve automatic segmentation, we need to model it as...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2012

Automatic Phonetic Segmentation for a Speech Corpus of Hebrew

نویسندگان

چکیده

منابع مشابه

Automatic Tools for Analyzing Spoken Hebrew

Minimum boundary error training for automatic phonetic segmentation

A Minimum Boundary Error Framework for Automatic Phonetic Segmentation

Impact of frame rate on automatic speech-text alignment for corpus-based phonetic studies

A Multimodal Corpus of Expert Gaze and Behavior during Phonetic Segmentation Tasks

عنوان ژورنال:

اشتراک گذاری